Multi-skilled Motion Control
نویسندگان
چکیده
Deep reinforcement learning has demonstrated increasing capabilities for continuous control problems, including agents that can move with skill and agility through their environment. An open problem in this setting is that of developing good strategies for integrating or merging policies for multiple skills, where each individual skill is a specialist in a specific skill and its associated state distribution. We extend policy distillation methods to the continuous action setting and leverage this technique to combine expert policies, as evaluated in the domain of simulated bipedal locomotion across different classes of terrain. We also introduce an input injection method for augmenting an existing policy network to exploit new input features. Lastly, our method uses transfer learning to assist in the efficient acquisition of new skills. The combination of these methods allows a policy to be incrementally augmented with new skills. We compare our progressive learning and integration via distillation (PLAID) method against three alternative baselines.
منابع مشابه
Optimization of an energy based bi-objective multi skilled resource investment project scheduling problem
Growing concern in the management of energy due to the increasing energy costs, has forced managers to optimize the amount of energy required to provide products and services. This research integrates an energy-based resource investment project-scheduling problem (RIP) under a multi-skilled structure of the resources. The proposed energy based multi skilled resource investment problem (EB-MSRIP...
متن کاملAn Evolutionary Algorithm Based on a Hybrid Multi-Attribute Decision Making Method for the Multi-Mode Multi-Skilled Resource-constrained Project Scheduling Problem
This paper addresses the multi-mode multi-skilled resource-constrained project scheduling problem. Activities of real world projects often require more than one skill to be accomplished. Besides, in many real-world situations, the resources are multi-skilled workforces. In presence of multi-skilled resources, it is required to determine the combination of workforces assigned to each activity. H...
متن کاملAssembly line balancing problem with skilled and unskilled workers: The advantages of considering multi-manned workstations
This paper address a special class of generalized assembly line balancing in which it is assumed that there are two groups of workers: skilled and unskilled ones. The skilled workers are hired permanently while the unskilled ones can be hired temporarily in order to meet the seasonal demands. It is also assumed that more than one worker may be assigned to each workstation. To show the adv...
متن کاملPlanning Arm with 5 Degrees of Freedom for Moving Objects Based on Geometric Coordinates and Color
Skilled mechanical arms of consanguine relationship formed by joints the relative motion of the adjacent interfaces enable, have been connected. Ability to perform a variety of pre-programmed robotic manipulator in various industries. Skilled mechanical arms in recent years as a significant progress has been completed. House repair and easier to work with them as well and fit and optimal relati...
متن کاملMotor control and learning: The basics of skilled instrumental performance
This paper introduces some concepts from the field of motor learning and their possible applications to the doublebass. Basic issues of motor performance and perceptual-motor integration form a proposal to enhance skilled instrumental performance. Posture and motion are analyzed under principal guidelines of human ergonomics. Movements used in doublebass performance are abridged to the concept ...
متن کاملA Multi-Objective Particle Swarm Optimization for Mixed-Model Assembly Line Balancing with Different Skilled Workers
This paper presents a multi-objective Particle Swarm Optimization (PSO) algorithm for worker assignment and mixed-model assembly line balancing problem when task times depend on the worker’s skill level. The objectives of this model are minimization of the number of stations (equivalent to the maximization of the weighted line efficiency), minimization of the weighted smoothness index and minim...
متن کامل